Automatic Chinese Summarization Method Based on the HowNet and Clustering Algorithm
نویسندگان
چکیده
To solve the problems in traditional automatic Chinese summarization, a new method based on the word concept and clustering is presented in this paper. Different from the normal statistical method, concept is used as feature instead of word. Also, instead of word frequency statistics, word concept frequency statistics (WCFS) is used in our approach. For each paragraph, a conceptual vector space model is established, and then the clustering algorithm is used for multiple topic partition. The evaluation results show that the method proposed in this paper is more efficient and robust than the traditional one.
منابع مشابه
Automatic Text Summarization Based on Lexical Chains
The method of lexical chains is the first time introduced to generate summaries from Chinese texts. The algorithm which computes lexical chains based on the HowNet knowledge database is modified to improve the performance and suit Chinese summarization. Moreover, the construction rules of lexical chains are extended, and relationship among more lexical items is used. The algorithm constructs le...
متن کاملBiogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization
Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...
متن کاملAutomatic Summarization for Chinese Text Using Affinity Propagation Clustering and Latent Semantic Analysis
As the rapid development of the internet, we can collect more and more information. it also means we need the abitily to search the information which really useful to us from the amount of information quickly. Automatic summarization is useful to us for handling the huge amount of text information in the Web. This paper proposes a Chinese summarization method based on Affinity Propagation(AP)cl...
متن کاملAn Unsupervised Approach to Chinese Word Sense Disambiguation Based on Hownet
The research on word sense disambiguation (WSD) has great theoretical and practical significance in many fields of natural language processing (NLP). This paper presents an unsupervised approach to Chinese word sense disambiguation based on Hownet (an electronic Chinese lexical resource). In our approach, contexts that include ambiguous words are converted into vectors by means of a second-orde...
متن کاملخوشهبندی خودکار دادهها با بهرهگیری از الگوریتم رقابت استعماری بهبودیافته
Imperialist Competitive Algorithm (ICA) is considered as a prime meta-heuristic algorithm to find the general optimal solution in optimization problems. This paper presents a use of ICA for automatic clustering of huge unlabeled data sets. By using proper structure for each of the chromosomes and the ICA, at run time, the suggested method (ACICA) finds the optimum number of clusters while optim...
متن کامل